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Abstract. We investigate the asymptotic behavior of the Nadaraya- Watson esti- 
mator for the estimation of the regression function in a semiparametric regression 
model. On the one hand, we make use of the recursive version of the sliced inverse 
regression method for the estimation of the unknown parameter of the model. 
On the other hand, we implement a recursive Nadaraya- Watson procedure for the 
estimation of the regression function which takes into account the previous esti- 
mation of the parameter of the semiparametric regression model. We establish the 
almost sure convergence as well as the asymptotic normality for our Nadaraya- 
Watson estimator. We also illustrate our semiparametric estimation procedure on 
simulated data. 



The goal of this paper is to investigate the asymptotic behavior of the Nadaraya- 
Watson estimator of the regression function / in the semiparametric regression 
model given, for all n > 1, by 



where (X n ) is a sequence of independent and identically distributed random vectors 
of W and the driven noise (e n ) is a real martingale difference sequence independent 
of (X n ). We assume in all the sequel that the unknown p-dimensional parameter 
6 7^ 0. On the one hand, we make use of the recursive version of the sliced inverse 
regression (SIR) method, originally proposed by Li [TT] and Duan and Li [7], in order 
to estimate 9. On the other hand, we estimate the unknown regression function / 
via a recursive Nadaraya- Watson estimator which takes into account the previous 
estimation of the parameter 6. Our purpose is precisely to investigate the asymptotic 
behavior of the recursive Nadaraya- Watson estimator of /. 

One can find a wide range of literature on nonparametric estimation of a regression 
function. We refer the reader to [6], [13], [19], [21] for some excellent books on density 
and regression function estimation. In the classical situation without any parameter 
9, the almost sure convergence of the Nadaraya- Wat son estimator [T2"] . [22] was 
proved by Noda [IS] and its asymptotic normality was established by Schuster [T8] . 
Moreover, Choi, Hall and Rousson [4] propose three data-sharpening versions of the 
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Nadaraya-Watson estimator in order to reduce the asymptotic variance in the central 
limit theorem. In our situation, we propose to make use of a recursive Nadaraya- 
Watson estimator [8] of / which takes into account the previous estimation of the 
parameter 9. It is given, for all x G M p , by 

E n k=1 W k (x)Y k 

1 k( x ~®™- lXn ) 

where the kernel K is a chosen probability density function and the bandwidth (h n ) 
is a sequence of positive real numbers decreasing to zero, such that nh n tends to 
infinity. For the sake of simplicity, we propose to make use of h n = l/n a with 
a G ]0, 1 [. The main difficulty arising here is that we have to deal with the recursive 
SIR estimator 9 n of 9 inside the kernel K. 

The paper is organized as follows. Section 2 is devoted to the recursive SIR estimator 
9 n . Our main results on the asymptotic behavior of f n are given in Section 3. 
Under standard regularity assumptions on the kernel K, we establish the almost 
sure pointwise convergence of /„ together with its asymptotic normality. Section 
4 contains some numerical experiments on simulated data, illustrating the good 
performances of our semiparametric estimation procedure. All the technical proofs 
are postponed in Appendices A and B. 



(1.2) 
with 



fn(x) 



WJx) 



2. ON THE RECURSIVE SIR METHOD 

From the seminal work of Li (TTj and Duan and Li [7j devoted to the SIR theory, 
we know that the eigenvector associated with the maximum eigenvalue of the matrix 
E _1 r is collinear with 9 where £ = V(X n ) is positive definite, V = V(E,(X n \T(Y n )) 
and T is a slicing of the range of Y n into H non overlapping slices s±, ■ ■ ■ ,sh- One can 
observe that since the link function / is unknown in the semiparametric regression 



model (1.1), the parameter 9 is not entirely identifiable. Only its direction can be 
identified without assuming additional constraints. Li [TTJ called effective dimension 
reduction (EDR), any direction collinear with 9. Moreover, the SIR theory mainly 
relies on the so-called linear condition (LC) which imposes that for all b G M. p , 
E,[b'X n \9'X n ] is linear in 9'X n . It means that one can find a, (3 G M such that 

(LC) E[b'X n \9'X n ] = a + p9'X n . 

This condition is required to only hold for the true parameter 9. Since 9 is un- 
known, it is not possible in practice to verify it a priori. Hence, we can assume that 
(LC) holds for all possible values of 9, which is equivalent to elliptical symmetry of 
the distribution of the identically distributed sequence (X n ). Finally, Hall and Li 
[TO] mentioned that (LC) is not a severe restriction because (LC) holds to a good 
approximation in many problems as the dimension p of the regression vector X n 
increases. Chen and Li [S] or Cook and Ni [3] also provide interesting discussions 
on the linear condition. 
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In order to obtain a recursive version of an EDR direction estimated with SIR 
approach, we need an analytic expression of the maximum eigenvector of E _1 T. It 
is easily tractable when the range of Y n is divided into two non overlapping slices s\ 
and S2- Hereafter we shall assume that H = 2. In this special case, it is not hard 
to see that T = p x z\ + P2Z2 where p h = P{Y n G s h ) and z h = E[X n \Y n G s h ] - E[X n ] 
with ph 7^ for h — 1,2. Moreover, it is straightforward to show that the eigenvector 
associated to the maximum eigenvalue of E _1 T can be written as 

e = T,-\z l -z 2 ). 

This vector 9 is therefore an EDR direction. For the sake of simplicity, we identify 
in all the sequel the EDR direction 9 with 9. Our purpose is now to propose an 
estimator of the EDR direction 9. First of all, let us recall the non recursive SIR 
estimator 9 n of 9 given by Nguyen and Saracco [Hj. The estimator 9 n can be 
easily obtained from the sample (Xi, Yi), . . . , (X n , Y n ) by substituting the theoritical 
moments by their sample couterparts. More precisely, 9 n is given by 

(2.1) 9 n = K\^,n - Z 2 ,n) 

where 

j n 1 71 

(2.2) s n = — y (x k — x„)(x k — x n ) , x n = — y x k 

n L — 4 n L — 4 

fc=l k=l 

and, for h — 1, 2, z h>n = m h n — X n where 

Y n n 

(2.3) m h)n = 22X k l{Y h es h }, n h,n = ^2 l {Y k es h }- 

ah ' n k=l k=l 

Next, we focus our attention on the recursive SIR estimator 9 n of 9 proposed by 
Bercu, Nguyen and Saracco [I], [Ej. We split the sample into two parts: the 
subsample of the first (n — 1) observations (Xi,Yi), . . . , (X n -i, Y n _x), and the new 



observation (X n , Y n ). On the one hand, the inverse of the matrix E n given by (2.4) 
may be recursively calculated via the Riccati equation [S], 

(2 ' 4 > E "' = ^T E »- - (B -i)" B + A j ^^ 

where p n = ^^E^l^n and $ n = X n — X n ^%. On the other hand, we can also obtain 
the recursive form of z/, n . As a matter of fact, we have for h = 1,2, 



(2.5) z h . 



Z h *,n-1 ~ -$n + if h = h*, 

U Uh^n-x + 1 

Zhn-i - -®n otherwise, 
n 
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where h* denotes the slice containing the observation Y n and &h* n — X n — rrih* n -i- 



We deduce from (2.4) and (2.5) that the recursive SIR estimator 6 n is given by 

^ = (^1 ) Cl - (n - 1) (n + Pn ) 
' (-l)^n / , 1 



(n ft *, n _i + l)(n - 1) 



foi - -J-^M^i) 



The SIR estimators 6 n and # n share the same asymptotic properties, previously 
established in [H], under the following classical hypothesis. 

(Hi) The random vectors (X n ) are square integrable, independent and identically 
distributed and (Xl, Yi), . . . , (X n , Y n ) are independently drawn from (1.1 ). 

Lemma 2.1. Assume that (LC) and (Hi) hold. Then, 6 n converges a.s. to 6, 

log ( log n) 



(2.7) ||gn-eil a = o( 6V rc M 

In addition, we also have the asymptotic normality 

(2.8) yfn{6 n -0) -^N(0,A) 

where the limiting covariance matrix A may be explicitely calculated. 

3. MAIN RESULTS 

Our purpose is to investigate the asymptotic properties of the recursive Nadaraya- 
Watson estimator /„ of the link function / given by (1.2). First of all, we assume 
that the kernel K is a positive symmetric function, bounded with compact support, 
twice differentiable with bounded derivatives, satisfying 

/ K(x) dx — 1 and / K 2 (x)dx = u 2 . 

Jr Jr 

Moreover, it is necessary to add the following standard hypothesis. 

(H 2 ) The probability density function g associated with (X n ) is continuous, posi- 
tive on all M. p , twice differentiable with bounded derivatives. 
(H 3 ) The link function / is Lipschitz. 

Our first result deals with the almost sure convergence of the estimator f n . 

Theorem 3.1. Assume that (LC) and (Hi) to (H 3 ) hold. In addition, suppose that 
the sequence (X n ) has a finite moment of order a > 2. Then, for any x G M, we 
have 

(3.1) lim f n (x) = f(x) a.s. 

n— >co 

More precisely, if the bandwidth (h n ) is given by h n = l/n a with < a < 1/3, 

(3.2) f n (x) - f(x) = O (n-) + o(„vyi^) a.s. 
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while, if 1/3 < a < 1, 



(3.3) f n (x) - f(x) = O (vn^logn) + o(n 1/a \j log ^ gn ^ a .s. 

Proof. The proof is given Appendix A. □ 

Remark 3.1. In the particular case where (X n ) is a sequence of independent random 
vectors of MP sharing the same J\f(m, E) distribution where the covariance matrix E 



is positive definite, we can replace n 1 ' 0, by logn into (3.2) and (3.3). Consequently, 
for any x G M., we obtain that if < a < 1/3, 

f n (x) - f{x) = O {n- a ) a.s. 

while, if 1/3 < a < 1, 

f n (x) - f(x) = O (^y/n^ 1 lognj a.s. 

The asymptotic normality of the estimator f n is as follows. 

Theorem 3.2. Assume that (LC) and (Hi) to (H 3 ) hold. In addition, suppose that 
the sequence (X n ) has a finite moment of order a = 6 and that the sequence (e n ) 
has a finite conditional moment of order b > 2. Then, as soon as the bandwidth 
(h n ) satisfies h n = l/n a with 1/3 < a < 1, we have for any x G M, the pointwise 
asymptotic normality 

(l + a)h(6,x) 

where h(6,x) stands for the probability density function associated with (8'X n ). 
Proof. The proof is given Appendix B. □ 



(3.4) VnM/„ W - /(,)) A U(0, (1+ a Z<eJ 



4. NUMERICAL SIMULATIONS 

The goal of this Section is to illustrate via some numerical experiments the theo- 
retical results of Section 3. We will provide the numerical behavior of our recursive 
estimators combining the recursive Nadaraya- Watson estimator of the link function 
/ together with the recursive SIR estimator of the parameter 9. First of all, we de- 
scribe in Section 4.1 the simulated model used in the numerical study and we present 
the estimation procedure, in particular the choice of the bandwidth parameter a by 
a cross-validation criterion. Then, we illustrate in Sections 4.2 and 4.3 the almost 
sure convergence and the asymptotic normality of our recursive Nadaraya- Watson 
estimator of /. 

4.1. Simulated model and estimation procedures. We consider the semipara- 
metric regression model given, for all n > 1, by 

(M) Y n = f(6'X n )+e n 

where the link function / is defined, for all x G K, by 

/3x" 



f(x) = xexpl 



V 4 
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The parameter 6 belongs to W with p = 10 and is given by 

= -j!=fl,2,-2,-l,O,...,O 



Moreover, (X n ) is a sequence of independent random vectors of M? sharing the same 
Af(0, l p ) distribution, while (e n ) is a sequence of independent random variables with 
standard W(0, 1) distribution, independent of (X n ). In Figure 4.1| we present two 
scatterplots for a sample of size n = 1000 generated from model (M). On the left side, 
one can observe the data in the "true" reduction subspace, that is the scatterplot of 
(PX^Yi), {6'X n ,Y n ) based on the "true" EDR direction 0. On the right side, 
we plot the data obtained from the estimated EDR direction 9 n calculated via our 
recursive SIR procedure, that is the scatterplot of (O^Xx, Y\), . . . , {Q' n X n , Y n ). One 
can clearly notice that the EDR direction has been well estimated. 



>- 




-3-2-10123 -3-2-10123 



True index Estimated index 

Figure 4.1. 

Scatterplots of (e'X 1 ,Y 1 ), (9'X n , Y n ) and {6^X X , Y 1 ), (9^X n , Y n ). 

For the recursive Nadaraya- Watson estimator f n of /, we have chosen the well-known 
Epanechnikov kernel 

3 

K (x) = j(i - x2 )h\x\<i} 

and the bandwidth h n = l/n a with < a < 1. We now need to evaluate an optimal 
value for the smoothing parameter a. The problem of deciding how much to smooth 
is of great importance in nonparametric regression. We propose to make use of the 
optimal data-driven bandwidth a which minimizes the cross-validation criterion 

n 

CV{a) = ^2(Y k - Y Ka ) 2 where ? M = fk-M-iX k ). 

k=p+l 

We can observe by simulations that the CV(a) functions are all convex and the 
corresponding optimal data-driven bandwidth a lies into the interval [0.33,0.38]. 
Consequently, in all Section 4, we have chosen the optimal value a = 0.35. 
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4.2. Almost sure convergence. The good numerical performances of the recur- 
sive SIR estimator 6 n were perviously illustrated in [Tj, [Hj. In order to keep this 
section brief, we only focus our attention on the almost sure convergence of f n . We 
generate N = 1000 samples of different sizes n = 200, 500, 1000, 2000 from model 
(M) with p = 10. For each sample, we calculate the estimation f n {6' n x) of f(O'x ) for 
10 different values of x G W. The boxplots of the / n (6^x)'s are given in Figure 4.2 
The circle point in each boxplot represents the true value f(6'x) to easily judge the 
quality of the estimations. One can observe that the dispersion of the f n (9' n xys 
are small and the mean is very close to the true value f(6'x). One can also notice 
that the larger is the sample size n, the greater is the quality measure. As it was 
expected, the quality of the estimation decreases for large values of f{0'x) since the 



number of observations around x decreases, see the scatterplots of Figure 4.1 to be 
convinced. 



n 
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i 
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Figure 4.2. 

Almost sure convergence of f n (9' n x ) to f{0'x) for 10 different values of x. 
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4.3. Asymptotic normality. In order to illustrate the asymptotic normality of our 
recursive Nadaraya- Watson estimator, we generate N = 1000 realizations of f n (6' n x) 
for n = 1000 from model (M) with p = 10. In Figure 4.3, we plot the histogram of 



the standardized values of the f n (6' n xys for 2 different values of x G W. We add 
the density of the standard normal density on each histogram. One can clearly see 
that the normal density coincides pretty well with all the histograms, which visually 
illustrates the asymptotic normality of our recursive Nadaraya- Wat son estimator f n 
of/. 





Figure 4.3. 

Asymptotic normality of f n {9' n x) to f(9'x) for 2 different values of x. 



Appendix A 
PROOF OF THEOREM O 



In order to prove the almost sure pointwise convergence of Theorem 3T , we shall 
denote for all x € M 

n n 

P n (x) = W k (x)e k , N n (x) = W *(x), 



k=l 



k=l 



and 



Q n (x) = Y J W k (x)U{<$>k)- f(x)) 



k=l 



where $ n = 9'X n . We clearly obtain from the main decomposition 

Pnix) + Qn(x) 



(A.l) 



fn(x) ~ f(x) 



NJx) 
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We shall establish the asymptotic behavior of each sequence (P n (x)), (Q n (x)) and 
(N n (x)). Let (J- n ) be the filtration given by T n = cr(Xi, . . . , X n , Y±, . . . , Y n ). First 
of all, we can split N n (x) into two terms, 

(A.2) N n (x) = M^(x) + Ri N \x) 

where 

n n 

Mi N) (x) = J2(w k (x) -E[W k (x)\T k ^ and R^\x) = ^E^^)^]. 

k=l k=l 

On the one hand, we have 

E[W n (x)\F n „ 1 } = ±- [ K( X ~ 6n - lXn )g(x n )dx n . 

h-n Jrp \ rl n / 

We can assume without loss of generality that, for n large enough, at least one 
compon ent of 6 n is different from zero a.s. As a matter of fact, we already saw from 
that 6 n converges a.s. to 9 which is different from zero. For the sake of 



2.1 



Lemma 

simplicity, suppose that the first component n -i,i ^ a.s. We can make the change 
of variables 

x Q n _\X n 
z = 1 

and z 2 = x n> 2, . . . ,z p = x n>p . The Jacobian of this linear transformation is given by 



Consequently, we obtain that 

(A.3) E[W n {x)\T n ^] = [ K(z)h(9 n _ u x-zh n )dz 



where 



If 1 P 

h(6 n _ u x) = — / (x - ^ 6 n -i,kZk) , z 2} ■ ■ ■ } z p )dz 2 . . .dz p . 

\dn-l,l\ J^- 1 V #n-l,l V t^2 

One can observe that h(9,x) is exactly the probability density function associated 
with the identically distributed sequence (9'X n ). Therefore, as the probability den- 
sity function g is continuous, twice different iable with bounded derivatives, we de- 
duce from (A.3) togheter with Taylor's formula that 

E[W n {x)\F n ^} = f K(z)(h(9 n - 1 ,x)-zh n h'(e n - 1 ,x) 
Jr v 

2 1,2 

+ ^h"(6 n ^,x-zh n 0)dz, 

= h(e n . 1 ,x) + h ^ [ z 2 K{ z )h"{e n ^x-zh n i)dz 

6 .IV 
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where < £ < 1. Consequently, for n large enough, 



(A.4) 
where 



E[W n (x)\F n -!] -h(9 n -i,x) 



< M h T 2 hl a.s. 



M h = sup 



and t z 



x 2 K{x)dx. 



Hence, we find from (A.4) that 



n n 

Yjf[W k {x) \ F^} - h(0 k ^,x)\ = o(j2hi) 



a.s. 



k=l 



k=l 



It follows from the continuity of h together with the fact that 8 n converges to 9 a.s. 
and h n goes to zero that 



(A.S) 



1 n 

hm -yy.[w h (x)\T k - 1 \ = h{o, 



X 



a.s. 



k=l 



which of course immediately implies that for all x £ R 

R (N) (x) 

(A.6) lim n y ' = h{6,x) a.s. 

n->oo n 

On the other hand, (M„ (a;)) is a square integrable martingale difference sequence 
with predictable quadratic variation given by 



k=l 



J2(E[W 2 (x)\T k ^} - E 2 [W k (x)\F } 



k-l 



k=l 

Via the same change of variables as in ( A.3[ ), we obtain that 

nwi{x)\F n - x \ = ±- 

"n 
1 

K „ 

z 2 h 2 

H — ^ Ji h"(9 n -i,x - zhnC) )dz 
where < £ < 1. Consequently, for n large enough, 



K 2 (z)h{6 n _i,x — zh n )dz, 
K 2 (z)(h(6 n ^ 1 ,x) - zh n ti(6 n _i,x) 



(A.7) 
where 



E[W 2 (x)\F n ^]-—h(6 n _ 1 ,x 



< M h fi 2 h n a.s. 



/ K 2 (x)dx and /i 2 = - / x 2 K 2 \ 
Jr 2 J R 



x)dx. 
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Hence, (A. 7) ensures that 

n 2 n 



k=l 



a.s. 



k=l 



However, it is not hard to see that 



lim — — — = . 

rwoo n 1+a ^— ' h k 1 + a 

k=l K 



Therefore, it follows from (A. 7) together with the almost sure convergence of h(9 n , x) 
to h(8,x) and Toeplitz's lemma that 



i 71 2 
(A.S) lim -^^E[^ 2 (x) | Tk-i) = ^—h{6 

k=l 



X) 



a.s. 



Furthermore, we also have from (|A.4|) that 
(A.9) 



1 n 

lim - VE 2 [H/ fc (x)|J- fc _!] = h 2 (6,x) 

n— »oo Tl 



a.s. 



k=l 



Consequently, we deduce from (|A.8|) and (]A.9|) that for all i6l, 
(A.10) 



[a <^(,)>, = ^ 



/7 



1+a 



1 + a 



a.s. 



We are now in position to make use of the strong law of large numbers for martingales 
given e.g. by Theorem 1.3.15 of [8J. As the probability density function g is positive 
on its support, we have for all sGl, h(9, x) > 0, which implies that <M^ N \x) > n 
goes to infinity a.s. Hence, for any 7 > 0, (Mn N \x)) 2 = o(n 1+a (logn) 1+7 ) a.s. which 
leads to 



(A.ll) 



M[ N \x) = o{n) 



a.s. 



Then, we obtain from (|A.2|), (|A.6|) and (|A.11|) that for all x E 
(A.12) 



lim ^M = /^ )X ) 



a.s. 



n— >oo fl 



We shall now investigate the asymptotic behavior of the sequence (P n (x)). Since 
(X n ) and ( independent, (P n (x)) is a square integrable martingale difference 

sequence with predictable quadratic variation given by 

n n 

<p(x)> n = j^niPkix) - p k -i(x)) 2 \F k -i] = a 2 Yywiwfk-^ 



k=i 



k=i 



Then, it follows from convergence (|A.8|) that 
(A. 13) lim 



<P(x)>, 



a 2 u 2 



n— >oo n 



l+a 



1 + a 



h(6,x) 



a.s. 
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Consequently, we obtain from the strong law of large numbers for martingales that 
for any 7 > and that for all x £ R, 



(A. 14) P n (x) = o(V™ 1+a (log «) 1+7 ) = o(n) 



a.s. 



It remains to study the asymptotic behavior of the sequence (Q n (x)). We can split 
Q n (x) into two terms, 



(A.15) 

where $ n = Q' n _ x X n , 



Q n (x) = S n (a;) + A n (x) 



E n (x) = Y J W k {x)U^k) ~ /($ fc )) and A n (x) = ^W fc (x) (/($*) - /(re)). 



fe=i 



fe=i 



The right-hand side of (A.15) is easy to handle. As a matter of fact, the kernel K 



is compactly supported which means that one can find a positive constant A such 
that K vanishes outside the interval [—A, A}. Thus, for all n > 1 and all iGK, 

Wn[X) ~ h n V h n )hK^x n - x \<A hn y 

In addition, the function / is Lipschitz, so it exists a positive constant Cf such that 
for all n > 1 



(A.16) 



!/($„) - f(x)\ < c f \$ n -x\< c>|Ci*n - ^1 



Consequently, we obtain from (A.16) that for all x £ 



(A.17) 



|A n (x)| < CfY,W k {x)\6' k _ x X k -x\ 

k=l 

n 

< AC f J2 h kW k (x). 



k=i 



Moreover, via the same lines as in the proof of (A.5), we find that 

1 - 1 
(A.18) lim y^h,M{W k {x)\F k ^} = h(9,. 



x) a.s. 



k=l 



Furthermore, denote 

n 



k=l 



k-1 
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One can observe that (Mi A '(i)) is a square integrable martingale difference sequence 
with bounded increments and predictable quadratic variation given by 



<M( A \x)> 1 



£e[(M< a >(s)-m£>(s))V^i 

k=l 

n 



k=l 



k-1 



Hence, it follows from (|A.4|) and (|A.7|) together with the almost sure convergence of 

-h(9,x) 



h(6 n ,x) to h(9,x) and Toeplitz's lemma that 

<M( A )(a;)> n v 2 



(A.19) 



lim 

n— >oo 



n 



l-ot 



1 — a 



a.s. 



Consequently, we obtain from the strong law of large numbers for martingales that 



(A.20) 



(il4 A) (x)) 2 = o(n 1_a log7i 



a.s. 



Then, we infer from the conjunction of (|A.17|), (|A.18|) and (|A.20|) that for all x 6 
(A.21) 



|A n (x) 



On 



,1-Q 



a.s. 



The left-hand side of (A.15) is much more difficult to handle. We can use once again 
the assumption that the function / is Lipschitz to deduce that it exists a positive 
constant Cf such that for all n > 1 



(A.22) 

where n r 
x e R 

(A.23) 
Denote 



|/($„)-/($„)| <c>k 



i — 6)'X n . Hence, it immediately follows from (A.22) that for all 

n 

\Z n (x)\<C f J2w k (x)\n k \. 



k=l 



' n _iX n - x\ < Ah n \ and B n = \ \6'X n - x\ < Ah n + b n 



A = 

where (b n ) is a sequence of positive real numbers which will be explicitely given 
later. On the one hand, we immediately have from the triangle inequality that on 
the set A n fl B n , 

\ir n \ < 2Ah n + b n . 
On the other hand, we also have on the set A n fl B n , 

Ah n + K < \6'X n - x\ < |7T n | + \9n-iX n - x\ < \lT n \ + Ah n 
which implies that \n n \ > b n . Consequently, we obtain from ( |A.23 ) that 

n n n 

(A.24) |E n (x)| <2AC f J2h k W k (x)+C f J2hW k (x)+C f J2Wk(x)\7r k \I { ^ 



k=l 



k=l 



k=l 
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We already saw from (|A.21|) that 
(A.25) 



fe=i 



a.s. 



Moreover, it is assumed that the sequence (X n ) has a finite moment of order a > 2 
which ensures that 



sup ||Xfc|| = o(n 

Kfc<n 



I/ON 



a.s. 



Consequently, we find from Lemma < \2.1 ) that 



(A.26) 

where we can choose 



Kn| = o{b n ) a.s. 



K = » 



1/a 



log (log n) 



n 



Therefore, we clearly have 

n 

(A.27) ^W k {x)\ir k \I { \ nk \ >bk} < +00 

k=l 

Furthermore, it is not hard to see that 

n 

^b k = o(n 1/a V«log(log 



a.s. 



n, 



k=l 



Hence, via the same lines as in the proof of (|A.21|), we obtain that 
(A.28) 



J2hW k (x) = 0(n 1/a ^n\og(\ogn] 



a.s. 



k=l 



Then, we deduce from the conjunction of flA.24| ), ( |A.25| ), ( |A.27[ ), and ( |A.28[ ) that 

|E„(.r)| 0( //'-" ) + o(V /c y™iog(iog'< 



(A.29) 



; n 



a.s. 



Consequently, we infer from (A. 21) and (A.29) that for all x G 
(A.30) Q n (x) = O (n 1 -^ + O (n^y/nlogQog 



n. 



a.s. a.s. 



Finally, we can conclude from (A.l) together with (A. 12), (A. 14) and (A.30) that 

lim f n (x) = f(x) a.s. 



with the almost sure rates of convergence given by (3.2) and (3.3), which completes 
the proof of Theorem 3.1 □ 
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We already saw that (P n (x)) is a square integrable martingale difference sequence 
with predictable quadratic variation satisfying 



lim 



<P(x)>, 



n— >oo fl 



1+a 



1 + a 



h(9,x) 



a.s. 



In order to establish the asymptotic normality of Theorem |3.2[ it is necessary to 
prove that the sequence (P n (x)) satisfies the Lindeberg condition, that is for all 
e > 0, 



(B.l) -P n (x) = —J2®[\^P k (x)\ 2 I 



k=l 



L {|AP fc (x)|>ev / »?+"} 



V 







where AP n (x) = P n (x) — P n _i(x). We have assumed that the sequence (e n ) has a 
finite conditional moment of order b > 2 which means that 



supE[|£: n | < +oo 

n>0 



a.s. 



Consequently, for all e > 0, we have 
VJx) < 



(B.2) 



< 



1 


e b - 


-2 n c 




1 


e b - 


-2 n c 




1 


& 


~ 2 n° 



k=l 



k=l 



fc-ll 



^ L Kk<n 



k=l 



where c = 6(1 + a)/2. In addition, via the same lines as in the proof of (A. 8), we 
obtain that 



k=l 



+ a(b- 1) 



a.s. 



where 



K b (x) dx. 



Therefore, we deduce from (B.l) together with (B.2) and (B.3) that, for all e > 0, 

V n {x) = 0(n d ) 



a.s. 



where d — (2 — 6)(1 — a)/2. We recall that b > 2 which means that d < 0. It ensures 
that the Lindeberg condition is satisfied. Hence, it follows from the central limit 
theorem for martingales given e.g. by Corollary 2.1.10 of [S] that for all ieR, 

Pn(x) C 



(B.4) 



AM 



2 2 

1 + a 



h(e,x] 
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Furthermore, as soon as a > 6 and 1/3 < a < 1, we clearly obtain from (A. 30) that 
(B.5) lim = a.s. 



Finally, we find from (A.l) together with (A. 12), (B.4), (B.5) and Slutsky's lemma 
that, for all iGl, 

>^c/.w-/W)^v(o 1+ g ) 



which acheives the proof of Theorem 3.2 □ 
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